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Figure 1 - Human STR_50E1 - SEQ ID NO:1 

Nucleotide sequence of long splice variant 
[initiation ATG and stop codons are underlined] 

GGGCTCCCTG CACAAATGCG TTGGGTGATG GGGGCTGAAT . CCAGCCCACA CTGCACTTGC CAAGCCAGCT 7 0 

GGGGCCCTGG CACAAGACAG TCCCAGCCTG TTTTCACTGA CTTTGCTAAT TCTCACGGAG GCACCATGTG 14 0 

GTGTGGGAAG GCCCGGTCCT CGTAACCTCT CTGCTCCCAG GTCCCTGACC AGTCCTTAAC ACACAGTGGT 210 

CTTTGCTCAC CTGCGGCCCA GCTCTGGGCT CTCCCCACAG CATCCTTTGC CTTGCCTCCC TCCCATCTTC 2 80 

CTCTGGGCCT TCTCTCTGCT CCTGCCCAGG AAACTGTGCT CTCAG GAG CG CAGGAGCCAG CTCTCAGCCC 3 50 

CCATCTCCTG GGCACTCACC GTACTCAGGA AATATGTTCT GAATTCAGGA TTATCCTCAT TCTACTGAGA 4 20 

AGACCTGGAG GACAGAAATC AGCAAGACCT AAAGGGGAGA GGAAGGAGGG CCAGGCTGGG GTGGAGGTGC 4 90 

CCCACCCGGG AGCCCGGGCG CAGCCTCACC GCAGGCTGAT TCACAGAAGG CTCAGAGGGT TGCGAGGGCC 560 

CAATCGGCAC TGTCATCCTG CCCAGGCTCT GAGTCACCAG CTGGTGAGGG GCAGCTGCAG CCCAGCAGGA 630- 

AACAAAGTCT AGCATGGAAG AGGTGGGAGG GAGGTGGTGG GGCCTGAAAC CCCGCCTGGC TGGCCTTAGA 700 

GGAACTGGGA GTGACTGTCC GGCACTGGCT CAGCAGCAAA CAGCTCTCAA GGACGTGCTA GGAGTCAGGA 77 0 

ACTGGGCCAG CTCCGGTCCC TTCCTTTTGG GGCTCTCACT CTGGAGGATG GGGTGGATGG GAGGTCAGAG 840 ' 

GAGCACCAGC CTATGGCCCT GGACACCTGG GGTATTCAGC GAGTTCCTGG AGGACGGTGG GATGGGGCTG 910 

TGGTTCCAGC AAGAAAAAAC CGGGAAGATC CTGACGGAGT TCCTCCAGTT CTATGAAGAC CAGTATGGCG 980 

TGGCTCTCTT CAACAGCATG CGCCATGAGA TTGAGGGCAC GGGGCTGCCG CAGGCCCAGC TGCTCTGGCG 10 50 

CAAGGTGGCA CTGGACGAGC GCATCGTCTT CTCGGGGAAC CTCTTCCAGC ACCAGGAGGA CAGCAAGAAG 112 0 

TGGAGAAACC GCTTCAGCCT CGTGCCCCAC AACTACGGGC TGGTGCTCTA CGAAAACAAA GCGGCCTATG- 1190 

AGCGGCAGGT CCCACCACGA GCGGTCATCA ACAGTGCAGG CTACAAAATC CTCACGTCCG TGGACCAATA 12 60 

CCTGGAGCTC ATTGGCAACT CCTTACCAGG GACCACGGCA AAGTCGGGCA GTGCCCCCAT CCTCAAGTGC 1330 

CCCACACAGT TCCCGCTCAT CCTCTGGCAT CCTTATGCGC GTCACTACTA CTTCTGCATG ATGACAGAAG 14 00 

CCGAGCAGGA CAAGTGGCAG GCTGTGCTGC AGGACTGCAT CCGGCACTGC AACAATGGAA TCCCTGAGGA 14 7 0 

CTCCAAGGTA GAGGGCCCTG CGTTCACAGA TGCCATCCGC ATGTACCGAC AGTCCAAGGA GCTGTACGGC 154 0 

ACCTGGGAGA TGCTGTGTGG GAACGAGGTG CAGATCCTGA GCAACCTGGT GATGGAGGAG CTGGGCCCTG 1610 

AGCTGAAGGC AGAGCTCGGC CCGCGGCTGA AGGGGAAACC GCAGGAGCGG CAGCGGCAGT GGATCCAGAT 1680 

CTCGGACGCC GTGTACCACA TGGTGTACGA GCAGGCCAAG GCGCGCTTCG AGGAGGTGCT GTCCAAGGTG 1750 

CAGCAGGTGC AGCCGGCCAT GCAGGCCGTC ATCCGAACTG ACATGGACCA AATTATCACC TCCAAGGAGC 182 0 

ACCTTGCCAG CAAGATCCGA GCCTTCATCC TCCCCAAGGC AGAGGTGTGC GTGCGGAACC ATGTCCAGCC 18 90 

CTACATCCCA TCCATCCTGG AGGCCCTGAT GGTCCCCACC AGCGAGGGCT TCACTGAGGT GCGAGATGTC 19 60 

TTCTTCAAGG AGGTCACGGA CATGAACCTG AACGTCATCA ACGAGGGCGG CATTGACAAG CTGGGCGAGT 2 030 
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ACATGGAGAA GCTGTCCCGG CTGGCGTACC ACCCCCTGAA GATGCAGAGC TGCTATGAGA AGATGGAGTC 2100 

GCTGCGACTG GACGGGCTGC AGCAGCGATT TGATGTGTCC AGCACGTCCG TGTTCAAGCA GCGAGCCCAG 217 0 

ATCCACATGC GGGAGCAAAT GGACAATGCC GTGTATACGT TCGAGACCCT CCTGCACCAG GAGCTGGGGA 2240 

AGGGGCCCAC CAAGGAGGAG CTGTGCAAGT CCATCCAGCG GGTCCTGGAG CGGGTGCTGA AAAAATACGA 2 310 

CTACGACAGC AGCTCTGTGC GGAAGAGGTT CTTCCGGGAG GCGCTGCTGC AGATCAGCAT CCCGTTCCTG 2*38 0 

CTCAAGAAGC TGGCCCCTAC CTGCAAGTCG GAGCTGCCCC GGTTCCAGGA GCTGATCTTC GAGGACTTTG 2 4 50 

CCAGGTTCAT CCTGGTGGAA AACACGTACG AGGAGGTGGT GCTGCAGACC GTCATGAAGG ACATCCTGCA 2 52 0 

GGCTGTGAAG GAGGCCGCGG TGCAGAGGAA GCACAACCTC TACCGGGACA GCATGGTCAT GCACAACAGC 2 5 90 

GACCCCAACC TGCACCTGCT GGCCGAGGGC GCCCCCATCG ACTGGGGCGA GGAGTACAGC AACAGCGGCG 2 660 

GGGGCGGCAG CCCCAGCCCC AGCACCCCGG AGTCAGCCAC CCTCTCGGAA AAGCGACGGC GCGCCAAGCA 2 7 30 

GGTGGTCTCT GTGGTCCAGG ATGAGGAGGT GGGGCTGCCC TTTGAGGCTA GCCCTGAGTC ACCACCACCT 2800 

GCGTCCCCGG ACGGTGTCAC TGAGATCCGA GGCCTGCTGG CCCAAGGTCT GCGGCCTGAG AGCCCCCCAC 28 7 0 

CAGCCGGCCC CCTGCTCAAC GGGGCCCCCG CTGGGGAGAG TCCCCAGCCT AAGGCCGCCC CCGAGGCCTC 2940 

CTCGCCGCCT GCCTCACCCC TCCAGCATCT CCTGCCTGGA AAGGCTGTGG . ACCTTGGGCC CGCCAAGCCC 3010 

AGCGACCAGG AGACTGGAGA GCAGGTGTCC AGCCCCAGCA 'GCCACCCCGC CCTCCACACC ACCACCGAGG 3080 

ACAGTGCAGG GGTGCAGACT GAGTTCTAGG CCAGTGGGTC CCTGACTGCT GCACATGGCA GAGGCCGTTC 3150 

CCTTCCGGAC CCAGGGAGGC TCAGCTCTGG GGAGGGCACC CTGGTCTGTG CCTTGTGGGT GGAGGCGGGG' 3220 

CAGGGCTGTG TGGCACCGCC AGGGAGCGGG CCCACCTGAG TCACTTTATT GGGTTCAGTG AACACTTTCT 32 90 

TGCTCCCTGT TTTCTCTTCT GTGGGATGAT CTCAGATGCA GGGGCTGGTT TTGGGGTTTT CCTGCTTGTG 3360 

CCAAGGGGTG GACACTGCTG GGGGGCTGGA AA'GCCCCTCC CTTCCTGTCC TTCTGTGGCC -TCCATCCCCT 3430 

CATGGGTGCT GCCATCCTTC CTGGAGAGAG GGAGGTGAAA GCTGGTGTGA GCCCAGTGGG TTCCCGCCCA 3500 

CTCACCCAGG AGCTGGCTGG GCCAGGACCG GGAGAGGGAG CACTGCTGCC CTCCTGGCCC TGCTCCTTCC 357 0 

GCAGTTAGGG GTGGACCGAG CCTCGCTTTC CCCACTGTTC TGGAGGGAAG GGGAAGGAGG GGGTCTTCAG 3640 

GCTGGAGCCA GGCTGGGGGT GCTGGGTGGA GAGATGAGAT TTAGGGGGTG CCTCATGGGG TGGGCAGGCC 3710 

TGGGGTGAAA TGAGAAAGGC CCAGAACGTG CAGGTCTGCG GAGGGGAAGT GTCCTGAGTG AAGGAGGGGA 3 780 

CCCCATCCTG GGGATGCTGG GAGTGAGTGA GTGAGATGGC TGAGTGAGGG TTATGGGGAG CCTGAGGTTT 3850 

TATGGGCCTG TGTATCCCCT TCTCCCGGCC CCAGCCTGCC TCCCTCCTGC CCGCCTGGCC CACAGGTCTC "3920 

CCTCTGGTCC CTGTCCCTCT GGTGGTTGGG GATGGAGCGG CAGCAAGGGG TGTAATGGGG CTGGGTTCTG 3990 

TCTTCTACAG GCCACCCCGA GGTCCTCAGT GGTTGCCTGG GGAGCCGGAC GGGGCTCCTG AGGGGTACAG 4 060 

GTTGGGTGGG CCCTCCCTGA GGGTCTGGGG TCAGGCTTTG GCCTCTGCTG CCTCTCAGTC ACCAAGTCAC 4130 

CTCCCTCTGA AAATCCAGTC CCTTCTTTGG ATGTCCTTGT GAGTCACTCT GGGCCTGGCT GTCGTCCCTC 4 200 

CTCAGCTTCT TGTTCCTGGG ACAAGGGTCA AGCCAGGATG GGCCCAGGCN TGGGATCCCC CACCCCAGGA 427 0 

CCCCACAGGC CCCCTCCCCT - GNTGNTTTGC GGGGGGCAGG GCAGAAATGG ACTCCTTTTG GGTCCCCGAG ' 4340 

GTGGGGTCCC CTCCCAGCCC TGCATCCTCC GTGCCCTAGA CCTGCTCCCC AGAGGAGGGG CCTTGACCCA 4 410 
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CAGGAAGTGT GGTGGCGCCT GGCAATCAGG GACCCCCAGC TGCCGCAGCC CTGGTTTTTG GCGCATCTTT 44 8 0 
TCCCTCTTGT CCCGAAGATT TGCGCCTTTA GTGCCTTTTG AGGGGTTCCC ATCATCCCTC CCTGATATTG 4 550 
TATTGAAAAT ATTATGCACA CTGTTCATGC TTTTACTAAT CAATAAACGC TTTATTTAAA AAAAAAAAAA 4620 
AAA 4 62 3 
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Figure 2 - Human STR_50E1 - SEQ ID NO:2 

Predicted polypeptide of long splice variant 
(Alternatively-spliced expn is marked) 
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Figure 3 - Human STR_50E1 - SEQ ID NO:3 

Nucleotide sequence of short splice variant 
(Initiation ATG and stop codons are underlined) 

GGGCTCCCTG CACAAATGCG TTGGGTGATG GGGGCTGAAT CCAGCCCACA CTGCACTTGC CAAGCCAGCT 7 0 
GGGGCCCTGG CACAAGACAG TCCCAGCCTG TTTTCACTGA CTTTGCTAAT TCTCACGGAG- GCACCATGTG > 140 

GTGTGGGAAG GCCCGGTCCT CGTAACCTCT CTGCTCCCAG GTCCCTGACC AGTGCTTAAC ACACAGTGGT 210 

CTTTGCTCAC CTGCGGCCCA GCTCTGGGCT CTCCCCACAG CATCCTTTGC CTTGCCTCCC TCCCATCTTC 2 80 

CTCTGGGCCT TCTCTCTGCT CCTGCCCAGG AAACTGTGCT CTCAGGAGCG CAGGAGCCAG CTCTCAGCeC. 350 

CCATCTCCTG GGCACTCACC GTAGTCAGGA AATATGTTCT GAATTCAGGA TTATCCTCAT TCTACTGAGA 4 20 

AGACCTGGAG GACAGAAATC AGCAAGACCT AAAGGGGAGA GGAAGGAGGG CCAGGCTGGG GTGGAGGTGC 4 90 

CCCACCCGGG AGCCCGGGCG CAGCCTCACC GCAGGCTGAT TCACAGAAGG CTCAGAGGGT TGCGAGGGCC 560- 

CAATCGGCAC TGTCATCCTG CCCAGGCTCT GAGTCACCAG CTGGTGAGGG GCAGCTGCAG CCCAGCAGGA 630 

AACAAAGTCT AGCATGGAAG AGGTGGGAGG GAGGTGGTGG GGCCTGAAAC CCCGCCTGGC TGGCCTTAGA 7 00 

GGAACTGGGA GTGACTGTCC GGCACTGGCT CAGCAGCAAA CAGCTCTCAA GGACGTGCTA GGAGTCAGGA 7 70 

ACTGGGCCAG CTCCGGTCCC TTCCTTTTGG GGCTCTCACT CTGGAGGATG GGGTGGATGG GAGAAAAAAC 84 0 

CGGGAAGATC CTGACGGAGT TCCTCCAGTT CTATGAAGAC CAGTATGGCG TGGCTCTCTT CAACAGCATG 910 

CGCGATGAGA TTGAGGGCAC GGGGCTGCCG CAGGCCCAGC TGCTCTGGCG CAAGGTGCCA CTGGACGAGC 98 0 

s GCATGGTCTT CTCGGGGAAC CTCTTCCAGC ACCAGGAGGA CAGCAAGAAG TGGAGAAACC GCTTCAGCCT 1050 

CGTGCCCCAC AACTAGGGGC TGGTGCTCTA CGAAAACAAA GCGGCCTATG AGCGGCAGGT CCCACCACGA 112 0 

GCCGTCATCA ACAGTGCAGG CTACAAAATC CTCACGTCCG TGGACCAATA CCTGGAGCTC ATTGGCAACT 1190 

CCTTACCAGG GACCACGGCA AAGTCGGGCA GTGCCCCCAT CCTCAAGTGC CCCACACAGT TCCCGCTCAT 12 60 

CCTCTGGCAT CCTTATGCGC GTCACTACTA CTTCTGCATG ATGACAGAAG CCGAGCAGGA CAAGTGGCAG 1330 

GCTGTGCTGC AGGACTGCAT CCGGCACTGC AACAATGGAA TCCCTGAGGA CTCCAAGGTA GAGGGCCCTG 14 00 

CGTTCACAGA TGCCATCCGC ATGTACCGAC AGTCCAAGGA GCTGTACGGC ACCTGGGAGA ■ TGCTGTGTGG 14 7 0 

GAACGAGGTG CAGATCCTGA GCAACCTGGT GATGGAGGAG CTGGGCCCTG AGCTGAAGGC AGAGCTCGGC 154 0 

CCGCGGCTGA AG GGG AAACC GCAGGAGCGG CAGCGGCAGT GGATCCAGAT CTCGGACGCC GTGTACCACA 1610 

TGGTGTACGA GCAGGCCAAG GCGCGCTTCG AGGAGGTGCT GTCCAAGGTG CAGCAGGTGC AGCCGGCCAT 168 0 

GCAGGCCGTC ATCCGAACTG ACATGGACCA AATTATCACC TCCAAGGAGC ACCTTGCCAG.. CAAGATCCGA 17 50 

GCCTTCATCC TCCCCAAGGC AGAGGTGTGC GTGCGGAACC ATGTCCAGCC CTACATCCCA TCCATCCTGG 18 2 0 

AGGCCCTGAT GGTCCCCACC AGCCAGGGCT TCACTGAGGT GCGAGATGTC TTCTTCAAGG AGGTCACGGA 18 90 

CATGAACCTG AACGTCATCA ACGAGGGCGG CATTGACAAG CTGGGCGAGT ACATGGAGAA GCTGTCCCGG 1960 

CTGGCGTACC ACCCCCTGAA GATGCAGAGC TGCTATGAGA AGATGGAGTC GCTGCGACTG GACGGGCTGC 2030 
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AGCAGCGATT TGATGTGTCC AGCACGTCCG TGTTCAAGCA GCGAGCCCAG ATCCACATGC GGGAGCAAAT 2100 

GGACAATGCC GTGTATACGT TCGAGACCCT CCTGCACCAG GAGCTGGGGA AGGGGCCCAC CAAGGAGGAG 217 0 

CTGTGCAAGT CCATCCAGCG GGTCCTGGAG CGGGTGCTGA AAAAAT AC G A CTACGACAGC AGCTCTGTGC 2 24 0. 

GGAAGAGGTT CTTCCGGGAG GCGCTGCTGC AGATCAGCAT CCCGTTCCTG CTCAAGAAGC TGGCCCCTAC 2 310 



AACACGTACG- AGGAGGTGGT GCTGCAGACC GTCATGAAGG ACATCCTGCA GGCTGTGAAG GAGGCCGCGG 2450 

TGCAGAGGAA GCACAACCTC TACCGGGACA GCATGGTCAT GCACAACAGC GACCCCAACC TGCACCTGCT 2520 

GGCCGAGGGC GCCCCCATCG ACTGGGGCGA GGAGTACAGC AACAGCGGCG GGGGCGGCAG CCCCAGCCCC 2590 

AGCACCCCGG AGTCAGCCAC CCTCTCGGAA AAGCGACGGC GCGCCAAGCA GGTGGTCTCT GTGGTCCAGG 2 660 

ATGAGGAGGT GGGGCTGCCC TTTGAGGCTA GCCCTGAGTC ACCACCACCT GCGTCCCCGG ACGGTGTCAC 2 7 30 

TGAGATCCGA GGCCTGCTGG CCCAAGGTCT GCGGCCTGAG AGCCCCCCAC CAGCCGGCCC CCTGCTCAAC 2800 

GGGGCCCCCG CTGGGGAGAG TCCCCAGCCT AAGGCCGCCC CCGAGGCCTC CTCGCCGCCT GCCTCACCCC 2 870 

TCCAGCATCT CCTGCCTGGA AAGGCTGTGG ACCTTGGGCC GCCCAAGCCC AGCGACCAGG AGACTGGAGA 2 94 0 

GCAGGTGTCC AGCCCCAGCA GCCACCCCGC CCTCCACACC ACCACCGAGG ACAGTGCAGG GGTGCAGACT 3010 

GAGTTC TAG G CCAGTGGGTC CCTGACTGCT GCACATGGCA CAGGCCGTTC CCTTCCGGAC CCAGGCAGGC 3080 

TCAGCTCTGG GGAGGGCACC CTGGTCTGTG CCTTGTGGGT GGAGGCGGGG CAGGGCTGTG TGGCACCGCC 3150 

AGGGAGCGGG CCCACCTGAG TCACTTTATT GGGTTCAGTC AACACTTTCT TGCTCCCTGT TTTCTCTTCT 32 2 0 

GTGGGATGAT CTCAGATGCA GGGGCTGGTT .TTGGGGTTTT CCTGCTTGTG CCAAGGGCTG- GACACTGCTG 3290 

GGGGGCTGGA AAGCCCCTCC CTTCCTGTCC TTCTGTGGCC TCCATCCGCT CATGGGTGCT GCCATCCTTC 3360 

CTGGAGAGAG GGAGGTGAAA GCTGGTGTGA GCCCAGTGGG' TTCCCGCCCA CTCACCCAGG AGCTGGCTGG 3430 

GCCAGGACCG GGAGAGGGAG CACTGCTGCC CTCCTGGCCC TGCTCCTTCC GCAGTTAGGG GTGGACCGAG 350 0 

CCTCGCTTTC CCCACTGTTC TGGAGGGAAG GGGAAGGAGG GGGTCTTCAG. GCTGGAGCGA GGCTGGGGGT- 3570 

GCTGGGTGGA GAGATGAGAT TTAGGGGGTG CCTCATGGGG TGGGCAGGCC TGGGGTGAAA TGAGAAAGGC. 3 64 0 

CCAGAACGTG CAGGTCTGCG GAGGGGAAGT GTCCTGAGTG AAGGAGGGGA CCCCATCCTG GGGATGCTGG 3710 

GAGTGAGTGA GTGAGATGGC TGAGTGAGGG TTATGGGGAG CCTGAGGTTT TATGGGCCTG TGTATCCCCT 37 8 0 

TCTCCCGGCC CCAGCCTGCC TCCCTCCTGC CCGCCTGGCC CACAGGTCTC CCTCTGGTCC CTGTCCCTCT 3850, 

GGTGGTTGGG* GATGGAGCGG CAGCAAGGGG TGTAATGGGG CTGGGTTCTG TCTTCTACAG GCCACCCCGA 3 92 0 

GGTCCTCAGT GGTTGCCTGG GGAGCCGGAC GGGGCTCCTG AGGGGTACAG GTTGGGTGGG CCCTCCCTGA 3990 

GGGTCTGGGG TCAGGCTTTG GCCTCTGCTG CCTCTCAGTC ACCAAGTCAC CTCCCTCTGA AAATCCAGTC 4 0 60 

CCTTCTTTGG ATGTCCTTGT GAGTCACTCT GGGCCTGGCT GTCGTCCCTC CTCAGCTTCT TGTTCCTGGG 4130 

ACAAGGGTCA AGCCAGGATG GGCCCAGGCN TGGGATCCCC CACCCCAGGA CCCCACAGGC CCCCTCCCCT. 4 2 00 

GNTGNTTTGC GGGGGGCAGG GCAGAAATGG ACTCCTTTTG GGTCCCCGAG GTGGGGTCCC CTCCCAGCCC -4270 

TGCATCCTCC GTGCCCTAGA CCTGCTCCCC AGAGGAGGGG CCTTGACCCA CAGGAAGTGT GGTGGCGCCT 4 34 0 

GGCAATCAGG GACCCCCAGC TGCCGCAGCC CTGGTTTTTG GCGCATCTTT TCCCTCTTGT CCCGAAGATT 4 410 



CTGCAAGTCG GAGCTGCCCC GGTTCCAGGA GCTGATCTTC GAGGACTTTG. CCAGGTTCAT CCTGGTGGAA 2380 
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TGCGCCTTTA GTGCCTTTTG AGGGGTTCCC ATCATCCCTC CCTGATATTG TATTGAAAAT ATTATGCACA 4 4 80 
CTGTTCATGC TTTTACTAAT CAATAAACGC TTTATTTAAA AAAAAAAAAA AAA 4 533 
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Figur 4 - Human STR_50E1 - SEQ ID NO:4 

Predicted polypeptide of short splice variant 
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